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ABSTBACT 

Using conventional mental test theory, item 
parameters of an aptitude or achievement test vary with each group of 
examinees, and as such are somewhat limited in their use and 
interpretation. Within the last 25 years, measurement models have 
emerged in which item parameters are considered to be invariant. 
Generically referred to as latent trait mental measurement or item 
characteristic curve theoj.y, these models have precipitated the 
development of sophisticated tailored testing technigues which 
capitalize on the item paraiieter invariance feature* This paper 
examines the invariance feature for the Birnbaum 2 and 3 parameter 
logistic models. The investigation is in two parts, t^e first 
theoretical, the second empirical* Iven with somewhat inadequate 
data, t.he empirical investigation supported the theoretical 
considerations. (luthor/EC) 
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A Closer Lrok at Latent K-ait 
ParaiTeter Invarianra 



Using OTiventional nental test theory^ iten i^raireters of an aptiti^a 
or adiiey^nent test vary with eadi groi^ of examdjiees, and as audi are mcxm^ 
vAiat Itadtad in their use int^^retation ^ Within ttm past 25 years ^ 
nBasurement iradels ha^ emerged iji v^idti itmi j^rgmeters Me ronsitored to 
be invariarit (Lord^ 1952^ tosch, I960; Bimbaim^ 1968; Urry, 1970)* toi^i-- 
cally refened to as "latent trait mental rneasu^OTKit" or "itan diaracteristic 
curve theory," these nto^ls ha.VB precipitated the fevelopment of sophisticabed 
tailored testing tedhniqi^s whicii capitalize on tiie item parameter invarianoe 
feature. 

In tills paper ^ I will exardne this invariant feature for the Birr±>aum 2 
and 3 parameter logistic rnodels (BiCTtoijii, 1968) , The investigation will te 
In two parts, the first thMretiMil, the seoond CTpirical, 

A Theoretical E^amnation of Paranieter Invariant 
Bo"di mDdels jnder Mnsideration have b^n developed in the literature 
(Jens^ra/ 1972; Lord ^d Novickf 1968), md will rat be fully descrttied here. 
The ItCTi diaracteristic cnarve (ICC) in Figure 1 shows, for BirnbaOT^s 3 para^- 
meter logistic irodel, Uie regression of ability 8, against thm prdQability 
of a ODrrect re^onse, 

P{0) ^ % + (l^cg) [1 + mxp i-l,lag{Q-bg))r^ (1) 

The auUior is :uidebted to Carl Jmrnma, ^^in, and Jolm Con,^y for 
thair assistant witii tiie first draft of this report, to tte Office of 
Demographic studies, Gallaia&t College, Washijigton, D,C,, for sillying the 
data used in the study, and to Ja^u^lijie Cox and Joyoe Jtobinson for t^ing 
the mmiuscript. 



for a particular ita^ gj where 

-ttie iten ^ffic^-^^ is tiie ^int of djiflertJ.oi incaioa-^^g the iTOa- 
tion of the aurvm aloog toe 0 axisr 

Uie itan diacrmination^ is directly relabM to tha slc^ of the 

rarvs at i I ' 

ff - 

Ggf the pr^ability of qmBmmq tiie oomect :re^Qnse, is toe lo^rer bb^s^ 
tote of the curve. 

In the BtoAaxm 2 parainster logiBtic mofel^ ^"^^ itsns, 

equation 1 beocrea 

PCQ) ^ [1 + ^ ^"^^'7% (2) 
The paranretara b , a , and a sxm oonsicteEBd inwriant in toat the shape 
Of any givm IOC (for a p^tioiLar itmm) is toTOretically ijivariant aczoie 
popudatioiis and across testa* Ttmt iSf if an it^i ia trmaplanted into a 
M.fferent test of the same ability or administered to a different gro^p of 
esMd^eea ,. the shape of the ICC mil not ch^ge* 

The nia:nerical values for ag and bg, howe^^r^ are dependent i^on the scale 
used to define particvdar values of For lade off a natwal metric ^ toeae 
values of Q are UBually represented as deviations from the ineai true ability 
in a pre^tested p^ulation. Thus, if itmm pi^atfneters bxb separately callbratoi 
on two grotps of e^OTdneea vSioae ^ility distrJiutiona ^ffer, different 
nwerical TOlyea to reprea^t each Ug and bg 'parameter will be Stained,. TOils 
situation is illustrate in Pigtae 2, ^ 

Ite two rigid 8 acales may be represen^rf b^' and Let 6^', 
and represent toe nurreric \^uea cbtained i^liig the 0' scale, and let 
^g^' r represent toe valies i;sing toe Q" scale « Clewly, toe 

■.. . 4. 
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^g-^^g'' and ^g-f^^g'* In ttiaoEy, i^™e^mr, becai^e of diffic^ti^ 

of c^fcaining good Gg paraneter estlmtes, tiiis ^mot be aKpeotM in practice, 

Cimsi&r an eKardnee ^ose abili^ is and v^KS is a rrentoar of 

both groi5>s. If tiie sh^e of Uie IOC is ind^d inrajrimt^ this exardn^ will 
have the sate probabilitv of responding »rreotly regardless of © soale. 
ThSLt is, 

P(04) - P(0*) (3) 
utera Q* and are the numerics used to dsscribe on tiie Q' ^d 8»l#s 
re^sectively* 

N©v numerical \ml«s for a and i sl^uld be predictable witii eadi- 
dimge in Uie 0 s^le* I will dBrive md dtscuss sudi preMctiQn K[uations 
for both the Bi^rta^^ n 2 parOTS/ter and the BirrtoauTi 3 ^raneter logistic models^ 
and finally ot.11 describe an ^^irical investigation, 
BirnbaTO'g 2 ParanTSter Model 

By fefinitim, (eq[uation 2) 

PC©;;) = [1 + e>qp (-l*7a^*M0^-&^"))]~^ 
and 

P(Qi) = [1 ei^ H1.7a '(0i^5 
Considering 0" and 0' not ^[ual to md ^^S* substitutijig into (3) and 
Averting/ g^talni 

1 + eKp H-^7ag''i^-b^n) = 1 + e3q> M.7a^' (S^l-i^')) 
Subtracting 1 and taking the natural l^orithiriei 

Then 
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(5) 



Carsts (A) are u^d to indiGate that theea are predlcfced values, Nota 
that both (4) and (5) are liriear relatiQndiips * The sane "^^s of linear 
relationships hold for Lord's 2 parameter normal c^ive ire^l (sto Lord ^d 
Nbviok, 1968, Chester 16,11)* 
Bimbami's 3 Par^ieti^ Mo<tel 

In th^ry^ tkm values for ag md bg do not diange v4^i a guessing para^ 
neter, ogf is ad(ted. As mrada, ^uations 4 ^d 5 ^ould hold for the ttoae 
paraireter logistic model as well, HQ%^?aver^ the question will be altered 
allow^g for Qg^fhg". 

First 04 and ei are selected audi ttiat P(0i)=P(©*)^K, 0<K<1, Thm, 
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respictivsly. TSien 
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= 1.7 
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,3 ^ *9 (9) 



Agaiji, thase are linMr fmctioM, Note that v^&n q^* - K' ^ K" 

and ^uations 8 and 9 reduce to ^^tions 4 m)A 5 resE^tively, 

DlaQusslon 

^is dOTOnstration 1ms ^rmm tiiat mtti both tl^ BlrriDavan 2 and 3 p^a- 
met^ modelSi (l) a^" is a lu^ar fi^ictien of a^' mtti IntCTCept equal to zero 
and (2) b " is a linear fimctiOT of ' witii intercept not nesassacily equal 
to zero* 

In aquations 4 and 8, one niay rote ttiat 

(10) 

is a f mction solely of to« iretrics^ q' and Q", used to desariie 0^ aM 
find ©i' and fc^". miahf (10) and ronaajumtly (4) aM (5) constat for 
all itOTs mth^ tiie test and (8) (9) will vary slightly i&*ien batwean 
grpup gt^sing paranietacs are unequal. 

With the BBBch rrDdel, Qg' ^ ^ 0 md a^' ^ ag*' for all ibans, Md (10) 
is again constant. As with tte Bi^toum model i the difficul'ty (easiimas) 
parameter of the Pasch ncdel can also b© m^mtrndi to be nimBrically invariant 
over popTjdations, Thie is pr«isely Vihat ^darson/ et al* (1968) and Tinsley 

and Dawis {1972) fourd in empirical tovestigations # 

'. " . . . . .. . .. i - " ; - ' ' ' • ■ : 

I have shown ttat tte numerical values for andfc of both BirriDaim 
irodela change linear ily as a function of chaise mi rtetric. The pararoters 
are invar i^t in that tiiaoretically arm can alv^ys recalibra'te the Qg and 
b numeric values to a different S metric without any loss in inta^retation. 



An B^irlcal InVBStigatlon of Parareter Inv^ian€» 
The paraneter iri'^^ianra has b^n ^Msidered thMratiGally in the pra- 
oedlng pages ^ In this part of the paper, it will be treated erpiri^lly* 
Specif iMuLly, I will shw Urnt a dian^ in tte © inetric will precipitate linear 
dianges in the and bg vali^s. 
Proaedige 

Item raeponsee to ttie Level II Spellljig SiiDtest of the Stanford Adu^eve^ 
ment Test^ H^rdjig--topaired V^Bion (SA^ffiC) by 2,609 stiiaente were made 
available ttoou^ the oDurtesy of the Of fire of DOTo^aphlc St\3cU.es, Gallaudet 
Colleger Wa^iijigtQft, D*C* Eada of the 47 itara mUu^ this subtest prDvides 
coe ooroect and thr^ alternative re^cffiseSp 

Ttm 2,609 cases ymrm a part of a steatified random sanple of 6,872 stu- 
dianto to the Unitad States' special edueatlon prograTO for tte hewing^ittpaired 
t^ctog the ^tire ^T-Hl battery in tiim sprtog of 1974. Stratification was 
oonduoted on type of program aid enrol^iBnt size of tiim ijistitution* 

The ^cOTdnMS were dim^d into tro grou^ based en (^"OTOlogiral kga 
on tiie day of testing. Ite younger ^o^, gm\jp L, oontaijied 1,415 sifcjeatsi 
^e older groi:p, gmup H, 1,194, 

P^aineter estijtiates were oalibrated on ea^ groi^ separately usijig an 
iterative niinnjTOm diL square estwation prooediTC devel^ed Urry (1975) , 
As suggested by Sctoiidt and Gugel (1975), a mtoinMn of 1,000 subjects and 60 
items are needed to obtaiji satisfactory paraineter estimtes witii the Urry 
procedure * SinQe the number of itOTS avmilable on the SAT-^ffi is less ttian 
60, estimates from both groins may contoin aansv^at more rmito^ error than 
reocnir^fed. 

■ ■ . ^ - - 
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date were analyzM to test the null h^»thesas that (1) with ^tim 
intaroept set ^lal to zero, there is no strong Itoear r^^pessioii of ^ ^ 
ag, and (2) ttere is no strong linear re^reeeicm o£ on Fairly restrio- 
tiva R .75 tots soi^ht. To provide additional infomaticm, ttase recessions 
vere plotted (see Pigurea 3 and 4) and the prt^urt^^rannent cttroelatiais of 
obsarved md predicted parOTtBtsr values oonpubed. 

ReSTjJ.ts 

In the proceee of paranTSterizatiQn , 16 md 2 evtojecta wme topped from 
^oi^s L and H re^ectivmly, because Q estimates were imattein^le. ItCTi 
paranetar estJjnates w^e obtained on a total of 41 items ratroon to both groins, 
Ttmsm estJjnatea and tiie 4| md the ^ valine are etown in Table 1. The 
bivMiata productHrommt oorrelatiOTs betwem kg and and betw^n and 
are .677 md ^929, raspectively* 
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Table 1 



Obsaryed mSi PredLabBd and Values 



m a 41-ItCTi test 



PREDICTS 




H 






L 










1^1,194 






1^1,415 




A 

y 


K 

if 




9 


4 


g 




4 


1.57 


0.18 


1.70 


-0.02 


0.21 


1,49 


0,91 


0.24 


2.17 


-1.27 


2.03 


-1.24 


0.04 


2,06 


-0.74 


0.07 


1.43 


-0.68 


1.68 


-0.73 


0.10 


1 -36 


-0.07 


O.U 


1.33 


-1.16 


1.24 


-0.92 " 


0.06 


1.26 


-0.62 


0.10 


1,69 


-1.07 


1.73 


-1.08 


0.05 


1*60 


-0,52 


0,24 


1.26 


-0.54 


1.46 


-1.08 


0.06 


1,20 


0.09 


0.23 


1.72 


-1.12 


2.13 


-0.98 


0.02 


1.63 


-0.57 


0.09 


1.36 


-0.56 


1.54 


-0.71 


0.05 


1.29 


0.06 


0.17 


1.30 


0.32 


1.28 


-0.01 


0.16 


1,23 


1.06 


0,22 


1.65 


-0.11 


1.32 


-0.08 


0.14 


1,57 


0.58 


0,24 


0.85 


-0,13 


1.02 


-0,51 


0.13 


0.81 


0.55 


0,27 


1.17 


-0.37 


0.89 


-0.49 


0.11 


1.11 


0.28 


0,29 


0.77 


-0.16 


0.71 


0.27 


0.16 


0,73 


0,52 


0*12 


1.17 


-0.92 


1.42 


-0,84 


0.05 


1,11 


^0,35 


0.09 


0.76 


-0 • 79 


0.95 


-0.63 


0.05 


0.72 


-^0.20 


0.07 


1.22 


-0.06 


1.08 


-0.34 


p. 10 


1,16 


0.63 


0.21 


1.19 


0.09 


1.40 


-0.33 


6.16 


1,13 


0.80 


0,26 


1.26 


-0.27 


1.25 


-0.35 


0.22 


1.20 


0.39 


0.22 


1.06 


-0.16 ' 


1.05 


-0.22 


0.05 


1.01 


0.52 


0.14 


1.36 


-0.92 


1.48 


-0.89 


0.06 


1,29 


-0 . 35 


0.09 


2.00 


0.37 


1.77 


0.28 


0.16 


1*90 


1.12 


0.17 


1.88 


0.07 


1.15 


0.06 


0.15 


1.78 


0.78 


0.23 


1.02 


-0.96 


0.77 


-0.84 


0.05 


0.97 


--0.39 


0.09 


1.12 


-0.48 


1.42 


-0,58 


0.10 


1.06 


0.16 


0.14 


1.72 


-1.00 


1.30 


-0.85 


0.06 


1,63 


-^0,44 


0.08 


1.13 


-0.57 


1.16 


-0.45 


0.05 


1.07 


0.05 


0.08 


1.22 


-0.69 


1.21 


-0.65 


0.05 


1.16 


--0.08 


0.10 


1.60 


-0.51 


1.72 


-0.47 


O.IO 


1.52 


0.12 


0.17 


1.23 


0.52 


1.20 


0.33 


0.18 


1.17 


1.29 


0.26 


2.27 


1.19 


1.63 


1.72 


0.41 


2.15 


2,05 


0.32 


1.41 


-0.72 


1.27 


-0.82 


0.06 


1.34 


-0.12 


0.12 


1.25 


-0.70 


1.20 


-0.79 


0.06 


1.19 


-^0.10 


0.08 


1.17 


0.02 


1.89 


0.26 


0.14 


1,11 


0.72 


0.20 


1.42 


-0.23 


1,93 


-0.25 


0.09 


1.35 


0.44 


0.13 


0.70 


0.40 


0.75 


0.41 


0.12 


0.66 


1,15 


0.16 


1.39 


-0.29 


1.67 


-0.09 


0.19 


1,32 


0.37 


0.14 


1.60 


1.10 


2.17 


1.18 


0.26 


1.52 


1.95 


0.29 


1.39 


-0.16 


1.51 


-0.08 


0.22 


1.32 


0.52 


0-23 


0.99 


0.46 


1.12 


0.06 


0.18 


0.94 


1.22 


0,26 


1.83 


0.12 , 


1.56 


0,54 


0.23 


1.74 


0,84 


0-21 


1.21 


0.45/" 


1.28 


0.88 


0.26 


1,15 


1.21 


0.24 
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T^le 2 shows the siimiMY tableB for a b ragraBsims. It should 

be noted that tiie and bg ^grassionB aocomited for large ^roentageB of 
the paranstor y^ianoe (96% and 86% respertively) , IhuB, one can reject -Uie 
hypotheses that there is no strong linear relation^iip betwem valuBB with 
the Intercept set to zero md tii^ne is no l^e^ relation^iip biti^?Mn 

ft^ ^raluBs at R^>,75. 

' T^le 2 

Regxeseion StranMy Tables 

Regressical of on ^ Remession of on b~ 

(fixed interoept) 

mitipla R .97976 ^ftiltlple R .92919 

R S^iarad .95993 R ^aared .86339 

' Stmdard Error ' ; Stmd^d ECTor ^ 

of Estiination ,29121 of Estdination ,23883 

Slope (Unstandard^ Slope - .87973 

Ized) 1,05408 toterc^ 61600 

aiter^^ .00000 



The regressions of ag on aid of bg^bg are ill^trated in Figures 3 

_ ij _ 

md 4, The solid luies represent tte prediobed wlues of a and b and the 

. - ff 

dott^ lines repressit the 95% oonfidmce interval (±1,96 times S*E, est,) , 
Discussion 

Altiiough tile parOTieter estimates' nay cbontain icotm tlan the" fedofiiTiandB 
aniQUfits of error varianoe^ tiie prasmoe of strong Itoear relationships has 

sham between mli^s and beto^^i &^ values calibrated oti differ- 

^t Q scales. In effect^ this dKronstrates emdenoe for tiie inwrimoe of 
tiie discrimination and difficult lat^t -^ait pararetars, &vmi with £ ^ger 
itsT^ "^an reconnenfed. 



kn ^served value deviatijig f rem a predicts wlte l^di^bes Cflie or worm 
of the follovdngi 

1. a violation of the midunension^ity .reetri 

2. a "Eolation of tiie Icxal indepen^moe restrictioii, 

3. a poor paranieter eetiinate vAi&a callbratad cn one or botii groins. 
In light of the test size, the non-perfeot oorrelatims and deviations 

fran the prediction lines are perh^s attributoble to the third reason abo^* 
With existing paraniBter estiJiation pTOgrams, the estimates constantly 
contain more randcm error than the bg estMatee, In ad^ticn, there is sore 
restriction mi range mth regi^d to Uus par^neter* Henoe, the lower oonre-- 
lation for tiie a parfflne^s is to te e^c^. 

Conclusion 

In this paper, I ha^ developed the p^arneter in^wimce concept for the 
BirnbaOTi 2 and 3 pararneter logistic models, to a tteoretli^l level, I stas/^ 
that tiie numeric used to describe ag and bg Lm ^pendent i^n tte 0 scale 
definition, aid that ^Uirough Imear^uations, new valije Ug aid bg can be 
pre^cted vftai the 0 scale is redefined, 

Eyim mtS SOTi&^Aiat ina^qpate.^^ si^orted 
Uie theoretical considerations. At oonservati^^ H^lues, ttie sti^ty danon^ 
strated that a chaige in tiie 0 meteic precipitated linear dianges in toe 
vali^s ^d in the 6^ val«s. '^m predicted \mli^s of ag and correlate 
vdtii, and did not overly demate from, yie observed vali^s of and and 
l^ge perc^tages of the paranieter variances were e^lam^ by the linear 
^[imtions, 

15 



T]m presenoe of these linear functions si^ports the validity of the mD^l 
for test and for -Uie axar^ed eairples. TOmt is/ aBsomng that say_Bfaat03y 
p^anietar estJ^tes can be dbtalii^/ this prooedupa can bm used to chedc for 
violations of the midijrensionality and local indeprndenoe latent teait theory 
restrictions, Ttm presoioe of Mar ^rfect luiear ralatiOTSii^ bet^^een 
sanples would indirata that tiie resteictiona have been met * Ite absOTce 
betv^^en saniples wuld suggest ei-^er a restriction violatiai or poor para-- 
ireter estimates* 
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